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RAW SEQUENCE LISTING DATE: 03/3 0/20 

PATENT APPLICATION: US/09/804 , 626 TIME: 15:02:5" 



Input Set : A:\PTO.txt U™ B Lij 

Output Set: N:\CRF3\03302001\I804626.raw *— • 

3 <110> APPLICANT: Lobel, Leslie 

4 Lustbader, Joyce 

6 <120> TITLE OF INVENTION: EXPRESSION OF PROPERLY FOLDED AND SOLUBLE EXTRACELLULAR 

DOMAIN OF A 

7 GONADOTROPIN RECEPTOR 

9 <130> FILE REFERENCE: 0575/62259/ JPW/SHS 
C--> 11 <140> CURRENT APPLICATION NUMBER: US/09/804,626 
C--> 11 <141> CURRENT FILING DATE: 2001-03-09 

11 <160> NUMBER OF SEQ ID NOS : 8 

13 <170> SOFTWARE: Patentln version 3.0 

15 <210> SEQ ID NO: 1 

16 <211> LENGTH: 1557 

17 <212> TYPE: DNA 

18 <213> ORGANISM: HOMO SAPIENS 

20 <400> SEQUENCE: 1 

21 atgagcgata aaattattca cctgactgac gacagttttg acacggatgt actcaaagcg 



60 



23 gacggggcga tcctcgtcga tttctgggca gagtggtgcg gtccgtgcaa aatgatcgcc 120 
25 ccgattctgg atgaaatcgc tgacgaatat cagggcaaac tgaccgttgc aaaactgaac 
27 atcgatcaaa accctggcac tgcgccgaaa tatggcatcc gtggtatccc gactctgctg 
29 ctgttcaaaa acggtgaagt ggcggcaacc aaagtgggtg cactgtctaa aggtcagttg 
31 aaagagttcc tcgacgctaa cctggccggt tctggttctg gccatatgca ccatcatcat 
33 catcattctt ctggtctggt gccacgcggt tctggtatga aagaaaccgc tgctgctaaa 420 
35 ttcgaacgcc agcacatgga cagcccagat ctgggtaccg acgacgacga caaggccatg 480 
37 gctgatatcg gatccgaatt ccgagcgctg cgcgaggcgc tctgccctga gccctgcaac 540 
39 tgcgtgcccg acggcgccct gcgctgcccc ggccccacgg ccggtctcac tcgactatca 600 
41 cttgcctacc tccctgtcaa agtgatccca tctcaagctt tcagaggact taatgaggtc 660 
4 3 ataaaaattg aaatctctca gattgattcc ctggaaagga tagaagctaa tgcctttgac 720 
45 aacctcctca atttgtctga aatactgatc cagaacacca aaaatctgag atacattgag 
4 7 cccggagcat ttataaatct tccccgatta aaatacttga gcatctgtaa cacaggcatc 
4 9 agaaagtttc cagatgttac gaaggtcttc tcctctgaat caaatttcat tctggaaatt 
51 tgtgataact tacacataac caccatacca ggaaatgctt ttcaagggat gaataatgaa 
53 tctgtaacac tcaaactata tggaaatgga tttgaagaag tacaaagtca tgcattcaat 
55 gggacgacac tgacttcact ggagctaaag gaaaacgtac atctggagaa gatgcacaat 1080 
57 ggagccttcc gtggggccac agggccgaaa accttggata tttcttccac caaattgcag 1140 
59 gccctgccga gctatggcct agagtccatt cagaggctaa ttgccacgtc atcctattct 1200 
61 ctaaaaaaat tgccatcaag agaaacattt gtcaatctcc tggaggccac gttgacttac 1260 
63 cccagccact gctgtgcttt tagaaacttg ccaacaaaag aacagaattt ttcacattcc 1320 
65 atttctgaaa acttttccaa acaatgtgaa agcacagtaa ggaaagtgaa taacaaaaca 1380 

6 7 ctttattctt ccatgcttgc tgagagtgaa ctgagtggct gggactatga atatggtttc 1440 
69 tgcttaccca agacaccccg atgtgctcct gaaccagatg cttttaatcc ctgtgaagat 1500 
71 attatgggcg tcgacaagct tgcggccgca ctcgagcacc accaccacca ccactga 1557 

74 <210> SEQ ID NO: 2 

75 <211> LENGTH: 518 

76 <212> TYPE: PRT 

77 <213> ORGANISM: Homo Sapiens 

7 9 <400> SEQUENCE: 2 

81 Met Ser Asp Lys He He His Leu Thr Asp Asp Ser Phe Asp Thr Asp 

82 1 5 10 15 



180 
240 
300 
360 



780 
840 
900 
960 
1020 
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84 


Val : 


Leu Lys Ala Asp Gly Ala He Leu Val Asp ] 


Phe Trp Ala Glu Trp 


85 






20 








25 








30 




87 ■ 


Cys < 


Gly Pro Cys : 


Lys Met He Ala Pro lie Leu Asp Glu He Ala Asp 


88 




35 








40 








45 






90 < 


Glu ■ 


Tyr Gin Gly : 


Lys Leu Thr Val Ala Lys Leu Asn : 


lie Asp Gin Asn 


91 


50 








55 








60 








93 


Pro i 


Gly Thr Ala : 


Pro Lys Tyr Gly He Arg Gly : 


lie 1 


Pro Thr Leu Leu 


94 


65 








70 








75 








80 


96 


Leu 


Phe Lys Asn < 


Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 


97 








85 








90 








95 


99 


Lys i 


Gly Gin Leu : 


Lys Glu Phe Leu Asp Ala Asn : 


Leu Ala Gly Ser Gly 


100 








100 










105 










110 




102 


Ser 


Gly 


His 


Met 


His 


His 


His 


His 


His 


His 


Ser 


Ser 


Gly 


Leu 


Val Pro 


103 






115 










120 










125 






105 


Arg 


Gly 


Ser 


Gly 


Met 


Lys 


Glu 


Thr 


Ala 


Ala 


Ala 


Lys 


Phe 


Glu 


Arg Gin 


106 




130 










135 










140 








108 


His 


Met 


Asp 


Ser 


Pro 


Asp 


Leu 


Gly 


Thr 


Asp 


Asp 


Asp 


Asp 


Lys 


Ala Met 


109 


145 










150 










155 








160 


111 


Ala 


Asp 


He 


Gly 


Ser 


Glu 


Phe 


Arg 


Ala 


Leu 


Arg 


Glu 


Ala 


Leu 


Cys Pro 


112 










165 










170 










175 


114 


Glu 


Pro 


Cys 


Asn 


Cys 


Val 


Pro 


Asp 


Gly 


Ala 


Leu 


Arg 


Cys 


Pro 


Gly Pro 


115 








180 










185 










190 




117 


Thr 


Ala 


Gly 


Leu 


Thr 


Arg 


Leu 


Ser 


Leu 


Ala 


Tyr 


Leu 


Pro 


Val 


Lys Val 


118 






195 










200 










205 






120 


He 


Pro 


Ser 


Gin 


Ala 


Phe 


Arg 


Gly 


Leu 


Asn 


Glu 


Val 


He 


Lys 


He Glu 


121 




210 










215 










220 








123 


He 


Ser 


Gin 


He 


Asp 


Ser 


Leu 


Glu 


Arg 


He 


Glu 


Ala 


Asn 


Ala 


Phe Asp 


124 


225 










230 










235 








240 


126 


Asn 


Leu 


Leu 


Asn 


Leu 


Ser 


Glu 


He 


Leu 


He 


Gin 


Asn 


Thr 


Lys 


Asn Leu 


127 










245 










250 










255 


129 


Arg 


Tyr 


He 


Glu 


Pro 


Gly 


Ala 


Phe 


He 


Asn 


Leu 


Pro 


Arg 


Leu 


Lys Tyr 


130 








260 










265 










270 




132 


Leu 


Ser 


He 


Cys 


Asn 


Thr 


Gly 


He 


Arg 


Lys 


Phe 


Pro 


Asp 


Val 


Thr Lys 


133 






275 










280 










285 






135 


Val 


Phe 


Ser 


Ser 


Glu 


Ser 


Asn 


Phe 


He 


Leu 


Glu 


He 


Cys 


Asp 


Asn Leu 


136 




290 










295 










300 








138 


His 


He 


Thr 


Thr 


He 


Pro 


Gly 


Asn 


Ala 


Phe 


Gin 


Gly 


Met 


Asn 


Asn Glu 


139 


305 










310 










315 








320 


141 


Ser 


Val 


Thr 


Leu 


Lys 


Leu 


Tyr 


Gly 


Asn 


Gly 


Phe 


Glu 


Glu 


Val 


Gin Ser 


142 










325 










330 










335 


144 


His 


Ala 


Phe 


Asn 


Gly 


Thr 


Thr 


Leu 


Thr 


Ser 


Leu 


Glu 


Leu 


Lys 


Glu Asn 


145 








340 










345 










350 




147 


Val 


His 


Leu 


Glu 


Lys 


Met 


His 


Asn 


Gly 


Ala 


Phe 


Arg 


Gly 


Ala 


Thr Gly 


148 






355 










360 










365 






150 


Pro 


Lys 


Thr 


Leu 


Asp 


He 


Ser 


Ser 


Thr 


Lys 


Leu 


Gin 


Ala 


Leu 


Pro Ser 


151 




370 










375 










380 








153 


Tyr 


Gly 


Leu 


Glu 


Ser 


He 


Gin 


Arg 


Leu 


He 


Ala 


Thr 


Ser 


Ser 


Tyr Ser 


154 


385 










390 










395 








400 


156 


Leu 


Lys 


Lys 


Leu 


Pro 


Ser 


Arg 


Glu 


Thr 


Phe 


Val 


Asn 


Leu 


Leu 


Glu Ala 
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1 










40S 

*i \J -J 










410 










415 




1 ^ Q 

jld y 


Tnx 


Leu 


Til IT 


Tyr 


Pro 


C a y 
OCX 


Hi g 
illo 


^ J 






tr lie 


Arg 


Asn 


Leu 


Pro 


Thr 


160 








420 










425 










430 






162 


Lys 


Glu 


Gin 


Asn 


Phe 


Ser 


His 


Ser 


lie 


Ser 


Glu 


Asn 


Phe 


Ser 


Lys 


Gin 


163 






435 










440 










445 








165 


Cys 


Glu 


Ser 


Thr 


Val 


Arg 


Lys 


Val 


Asn 


Asn 


Lys 


Thr 


Leu 


Tyr 


Ser 


Ser 


166 




450 










455 










460 










168 


Met 


Leu 


Ala 


Glu 


Ser 


Glu 


Leu 


Ser 


Gly 


Trp 


Asp 


Tyr 


Glu 


Tyr 


Gly 


Phe 


169 


465 










470 










475 










480 


171 


Cys 


Leu 


Pro 


Lys 


Thr 


Pro 


Arg 


Cys 


Ala 


Pro 


Glu 


Pro 


Asp 


Ala 


Phe 


Asn 


172 










485 










490 










495 




174 


Pro 


Cys 


Glu 


Asp 


lie 


Met 


Gly 


Val 


Asp 


Lys 


Leu 


Ala 


Ala 


Ala 


Leu 


Glu 


175 








500 










505 










510 






177 


His 


His 


His 


His 


His 


His 























178 
180 



515 



<210> SEQ ID NO: 3 

181 <211> LENGTH: 1551 

182 <212> TYPE: DNA 

183 <213> ORGANISM: Homo Sapiens 

185 <400> SEQUENCE: 3 

186 atgagegata aaattattca cctgactgac gacagttttg 
188 gaeggggega tcctcgtcga tttctgggca gagtggtgcg 
190 ccgattctgg atgaaatege tgacgaatat cagggcaaac 
192 atcgatcaaa accctggcac tgcgccgaaa tatggcatcc 
194 ctgttcaaaa acggtgaagt ggcggcaacc aaagtgggtg 
196 aaagagttcc tegaegctaa cctggccggt tctggttctg 
198 catcattctt ctggtctggt gccacgcggt tctggtatga 

ttcgaacgcc agcacatgga cagcccagat ctgggtaccg 
gctgatatcg gatccgaatt cagggttttt ctctgccaag 
ccttctgacc tcccgaggaa tgccattgaa ctgaggtttg 
atccaaaaag gtgeatttte aggatttggg gacctggaga 
gatgtcttgg aggtgataga ggcagatgtg ttctccaacc 
agaattgaaa aggecaacaa cctgctctac atcacccctg 
aaccttcaat atctgttaat atccaacaca ggtattaagc 
attcattctc tccaaaaggt tttacttgac attcaagata 
gaaagaaatt ctttcgtggg gctgagcttt gaaagtgtga 
gggattcaag aaatacacaa ctgtgcattc aatggaaccc 
agegataata ataatttaga agaattgect aatgatgttt 
222 gtcattctag atatttcaag aacaaggatc cattccctgc 
224 cttaagaagc tgagggccag gtcgacttac aacttaaaaa 
226 cttgtcgccc tcatggaagc cagcctcacc tatcccagcc 
228 tggagaegge aaatctctga gcttcatcca atttgeaaca 
2 30 gttgattata tgactcaggc taggggtcag agatcctctc 
2 32 agctacagca gaggatttga catgaegtae actgagtttg 
234 gtggttgacg tgacctgctc ccctaagcca gatgeattea 
236 ggggtcgaca agettgegge cgcactcgag caccaccacc 

239 <210> SEQ ID NO; 4 

240 <211> LENGTH: 516 

241 <212> TYPE: PRT 



200 
202 
204 
206 
208 
210 
212 
214 
216 
218 
220 



acaeggatgt 
gtccgtgcaa 
tgaccgttgc 
gtggtatccc 
cactgtctaa 
gecatatgea 
aagaaacege 
acgacgacga 
agagcaaggt 
tcctcaccaa 
aaatagagat 
ttcccaaatt 
aggccttcca 
accttccaga 
acataaacat 
ttctatggct 
aactagatgc 
tccacggagc 
ctagctatgg 
agctgcctac 
attgctgtgc 
aatctatttt 
tggcagaaga 
actatgactt 
acccatgtga 
accaccactg 



actcaaagcg 
aatgatcgee 
aaaactgaac 
gaetctgetg 
aggtcagttg 
ccatcatcat 
tgctgctaaa 
caaggecatg 
gacagagatt 
gcttcgagtc 
ctctcagaat 
acatgaaatt 
gaaccttccc 
tgttcacaag 
ccacacaatt 
gaataagaat 
agtgaatcta 
ctctggacca 
cttagaaaat 
tctggaaaag 
etttgeaaac 
aaggcaagaa 
caatgagtcc 
atgcaatgaa 
agatatcatg 
a 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1551 
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242 <213> ORGANISM: Homo Sapiens 
244 <400> SEQUENCE: 4 

246 Met Ser Asp Lys He He His Leu Thr Asp Asp Ser Phe Asp Thr Asp 

247 i 5 10 15 

249 Val Leu Lys Ala Asp Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp 

250 20 25 30 

252 Cys Gly Pro Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp 

253 35 40 45 

2 55 Glu Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn 
256 50 55 60 

2 58 Pro Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu 
259 65 70 75 80 

2 61 Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 
262 85 90 95 

264 Lys Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly 

265 100 105 HO 

267 Ser Gly His Met His His His His His His Ser Ser Gly Leu Val Pro 

268 115 120 125 

270 Arg Gly Ser Gly Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gin 

271 130 135 140 

273 His Met Asp Ser Pro Asp Leu Gly Thr Asp Asp Asp Asp Lys Ala Met 

274 145 150 155 160 

276 Ala Asp He Gly Ser Glu Phe Arg Val Phe Leu Cys Gin Glu Ser Lys 

277 165 170 175 

279 Val Thr Glu He Pro Ser Asp Leu Pro Arg Asn Ala He Glu Leu Arg 

280 180 185 190 

282 Phe Val Leu Thr Lys Leu Arg Val He Gin Lys Gly Ala Phe Ser Gly 

283 195 200 205 

285 Phe Gly Asp Leu Glu Lys He Glu He Ser Gin Asn Asp Val Leu Glu 

286 210 215 220 

288 Val He Glu Ala Asp Val Phe Ser Asn Leu Pro Lys Leu Hxs Glu He 

289 225 230 235 240 

291 Arg He Glu Lys Ala Asn Asn Leu Leu Tyr He Thr Pro Glu Ala Phe 

292 245 250 255 

294 Gin Asn Leu Pro Asn Leu Gin Tyr Leu Leu He Ser Asn Thr Gly He 

295 260 265 270 

297 Lys His Leu Pro Asp Val His Lys He His Ser Leu Gin Lys Val Leu 

298 275 280 285 

300 Leu Asp He Gin Asp Asn He Asn He His Thr He Glu Arg Asn Ser 

301 290 295 300 

303 Phe Val Gly Leu Ser Phe Glu Ser Val He Leu Trp Leu Asn Lys Asn 

304 305 310 315 320 

306 Gly He Gin Glu He His Asn Cys Ala Phe Asn Gly Thr Gin Leu Asp 

307 325 330 335 

309 Ala Val Asn Leu Ser Asp Asn Asn Asn Leu Glu Glu Leu Pro Asn Asp 

310 340 345 350 

312 Val Phe His Gly Ala Ser Gly Pro Val He Leu Asp He Ser Arg Thr 

313 355 360 365 

315 Arg He His Ser Leu Pro Ser Tyr Gly Leu Glu Asn Leu Lys Lys Leu 
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316 




370 










375 










380 










318 


Arg 


Ala 


Arg 


Ser 


Thr 


Tyr 


Asn 


Leu 


Lys 


Lys 


Leu 


Pro 


Thr 


Leu 


Glu 


Lys 


319 


385 










390 










395 










400 


321 


Leu 


Val 


Ala 


Leu 


Met 


Glu 


Ala 


Ser 


Leu 


Thr 


Tyr 


Pro 


Ser 


His 


Cys 


Cys 


322 










405 










410 










415 




324 


Ala 


Phe 


Ala 


Asn 


Trp 


Arg 


Arg 


Gin 


He 


Ser 


Glu 


Leu 


His 


Pro 


He 


Cys 


325 








420 










425 










430 






327 


Asn 


Lys 


Ser 


He 


Leu 


Arg 


Gin 


Glu 


Val 


Asp 


Tyr 


Met 


Thr 


Gin 


Ala 


Arg 


328 






435 










440 










445 








330 


Gly 


Gin 


Arg 


Ser 


Ser 


Leu 


Ala 


Glu 


Asp 


Asn 


Glu 


Ser 


Ser 


Tyr 


Ser 


Arg 


331 




450 










455 










460 










333 


Gly 


Phe 


Asp 


Met 


Thr 


Tyr 


Thr 


Glu 


Phe 


Asp 


Tyr 


Asp 


Leu 


Cys 


Asn 


Glu 


334 


465 










470 










475 










480 


336 


Val 


Val 


Asp 


Val 


Thr 


Cys 


Ser 


Pro 


Lys 


Pro 


Asp 


Ala 


Phe 


Asn 


Pro 


Cys 


337 










485 










490 










495 




339 


Glu 


Asp 


He 


Met 


Gly Val 


Asp 


Lys 


Leu 


Ala 


Ala 


Ala 


Leu 


Glu 


His 


His 


340 








500 










505 










510 






342 


His 


His 


His 


His 



























343 515 

345 <210> SEQ ID NO: 5 

346 <211> LENGTH: 2100 

347 <212> TYPE: DNA 

348 <213> ORGANISM: Homo Sapiens 

350 <400> SEQUENCE: 5 

351 atgaagcagc ggttctcggc gctgcagctg ctgaagctgc tgctgctgct gcagccgccg 60 
353 ctgccacgag cgctgcgcga ggcgctctgc cctgagccct gcaactgcgt gcccgacggc 120 
35 5 gccctgcgct gccccggccc cacggccggt ctcactcgac tatcacttgc ctacctccct 180 
357 gtcaaagtga tcccatctca agctttcaga ggacttaatg aggtcataaa aattgaaatc 240 
359 tctcagattg attccctgga aaggatagaa gctaatgcct ttgacaacct cctcaatttg 300 
361 tctgaaatac tgatccagaa caccaaaaat ctgagataca ttgagcccgg agcatttata 3 60 
363 aatcttcccc gattaaaata cttgagcatc tgtaacacag gcatcagaaa gtttccagat 420 
365 gttacgaagg tcttctcctc tgaatcaaat ttcattctgg aaatttgtga taacttacac 480 
367 ataaccacca taccaggaaa tgcttttcaa gggatgaata atgaatctgt aacactcaaa 540 
369 ctatatggaa atggatttga agaagtacaa agtcatgcat tcaatgggac gacactgact 600 
371 tcactggagc taaaggaaaa cgtacatctg gagaagatgc acaatggagc cttccgtggg 660 
373 gccacagggc cgaaaacctt ggatatttct tccaccaaat tgcaggccct gccgagctat 720 
375 ggcctagagt ccattcagag gctaattgcc acgtcatcct attctctaaa aaaattgcca 7 80 
377 tcaagagaaa catttgtcaa tctcctggag gccacgttga cttaccccag ccactgctgt 840 
37 9 gcttttagaa acttgccaac aaaagaacag aatttttcac attccatttc tgaaaacttt 900 
381 tccaaacaat gtgaaagcac agtaaggaaa gtgaataaca aaacacttta ttcttccatg 960 
383 cttgctgaga gtgaactgag tggctgggac tatgaatatg gtttctgctt acccaagaca 1020 
385 ccccgatgtg ctcctgaacc agatgctttt aatccctgtg aagatattat gggctatgac 1080 
387 ttccttaggg tcctgatttg gctgattaat attctagcca tcatgggaaa catgactgtt 1140 
389 ctttttgttc tcctgacaag tcgttacaaa cttacagtgc ctcgttttct catgtgcaat 1200 
391 ctctcctttg cagacttttg catggggctc tatctgctgc tcatagcctc agttgattcc 1260 
393 caaaccaagg gccagtacta taaccatgcc atagactggc agacagggag tgggtgcagc 1320 
395 actgctggct ttttcactgt attcgcaagt gaactttctg tctacaccct caccgtcatc 1380 
397 actctagaaa gatggcacac catcacctat gctattcacc tggaccaaaa gctgcgatta 1440 
399 agacatgcca ttctgattat gcttggagga tggctctttt cttctctaat tgctatgttg 1500 
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